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ABSTRACT 

Queues,  stacks  (pushdown  stores),  and  tapes  are  storage  models 
which  have  direct  applications  in  compiler  design  and  the  general  design 
of  algorithms.  Whereas  stacks  (pushdown  store  or  last-in-first-out 
storage)  have  been  thoroughly  inves'igated  and  are  well  understood,  this 
is  much  less  the  case  for  queues  (first-in-first-out  storage).  This  paper 
contains  a  comprehensive  study  comparing  queues  to  stacks  and  tapes. 

We  address  off-line  machines  with  a  one-way  input,  both  deterministic 
and  nondeterministic.  The  techniques  rely  on  algorithmic  information 
theory  (Kolmogorov  Complexity). 
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1.  Introduction 

Queues,  stacks  (pushdown  stores),  and  tapes  are  storage  models  which  have  direct  appli¬ 
cations  in  compiler  design  and  the  general  design  of  algorithms.  Whereas  stacks  (push¬ 
down  store  or  last-in-first-out  storage)  have  been  thoroughly  investigated  and  are  well 
understood,  this  is  much  less  the  case  for  queues  (first-in-first-out  storage).  In  this  paper 
we  present  a  comprehensive  study  comparing  queues  to  stacks  and  tapes.  We  address 
off-line  machines  with  a  one-way  input.  In  particular,  1  queue  and  1  tape  (or  stack)  are 
not  comparable: 

(1)  Simulating  1  stack  (and  hence  1  tape)  by  l  queue  requires  Cl(n4^/\ogn  )  time  in 
both  the  deterministic  and  the  nondeterministic  cases. 

(2)  Simulating  1  queue  by  1  tape  requires  fl(n2)  time  in  the  deterministic  case,  and 
Q(n  4//3/Iogn )  in  the  nondeterministic  case; 

We  further  compare  the  relative  power  between  different  numbers  of  queues: 

(3)  Nondeterministically  simulating  2  queues  (or  2  tapes)  by  1  queue  requires 
Q(n2/(log2n  loglogn ))  time  and  deterministically  simulating  2  queues  (or  2  tapes) 
by  1  queue  requires  f2(n2)  time.  The  second  bound  is  tight.  The  first  is  almost 
tight. 

(4)  We  also  obtain  the  simulation  results  for  queues:  2  nondeterministic  queues  (or  3 
pushdown  stores)  can  simulate  k  queues  in  linear  time.  One  queue  can  simulate  k 
queues  in  quadratic  time. 

It  has  been  known  for  over  twenty  years  that  all  multitape  Turing  machines  can  be 
simulated  on-line  by  2-tape  Turing  machines  in  time  O(nlogn)  [HS2],  and  by  1-tape 
Turing  machines  in  time  O  (n2)  [HU],  Since  then,  many  other  models  of  computation 
have  been  introduced  and  compared.  (See  [Aa,  DGPR,  HSl,  HS2,  HU,  LS,  PSS,  Pa, 
Vi2j.)  In  addition  to  different  storage  mechanisms,  real-time,  on-line  and  off-line 
machines  have  been  studied.  An  on-line  machine  is  expected  to  give  an  answer  after 
reading  each  prefix  of  the  input.  In  this  paper,  we  consider  the  off-line  machines,  where 
an  answer  is  given  only  once  the  whole  input  has  been  read.  We  also  use  the  one-way 
input  convention,  where  the  machine  has  a  one-way  input,  a  finite  control  and  access  to 
some  storage. 

The  relative  power  of  stacks  and  tapes  is  more  or  less  well  known.  For  example, 
for  the  nondeterministic  case,  we  know  that  1  stack  <  1  tape  <  2  stacks  <  3  stacks  = 
k  stacks  =  k  tapes,  where  A  <B  means  that  B  can  simulate  A  in  linear  time,  while  A 
cannot  simulate  B  in  linear  time.  In  most  of  the  cases,  close  lower  and  upper  bounds 
are  known  for  the  simulation  [Ma,  Lil,  Vil.  LV,  Li2]. 

fn  this  paper,  we  give  a  complete  characterization  of  (off-line)  queue  machines.  The 
main  theorems  show  that  one  queue  machines  are  not  comparable  to  one  stack  or  one 
tape  machines,  both  deterministically  and  nondeterministically.  We  also  compare  the 
relative  power  of  machines  having  different  number  of  queues.  We  use  Kolmogorov  com¬ 
plexity  techniques  [Ko,  Ch,  So]  to  prove  the  theorems,  together  with  some  new  tech¬ 
niques  to  enable  us  to  deal  with  queues.  The  Kolmogorov  complexity  of  a  string  x, 
f\  {/  ).  is  the  length  of  the  shortest  program  printing  the  string  i  By  a  simple  counting 


argument,  we  know  that  there  are  strings  z  of  each  length  such  that  K(x)>  \  x  \  . 
These  strings  are  called  incompressible  or  K-random. 

In  section  2,  we  introduce  the  jamming  lemma  which  is  used  in  further  proofs.  In 
section  3,  we  show  that  deterministically  simulating  a  queue  by  a  tape  takes  quadratic 
time  (infinitely  often).  (For  the  lower  bound  on  the  simulation  time  of  1  queue  by  1  tape 
in  the  nondeterministic  case,  see  [Li3,  LV].)  In  section  4,  we  have  a  lower  bound  for  non- 
deterministically  simulating  a  stack  by  a  queue.  In  section  5,  we  present  lower  and 
upper  bounds  for  simulating  k  queues  by  2  queues  or  2  queues  by  one  queue. 

2.  The  Jamming  Lemma 

In  this  section,  we  are  concerned  with  one-tape  and  one-queue  off-line  TM’s  where 
the  Turing  machine  has  one  1-way  input  tape  in  addition  to  one  work  tape  or  one  queue, 
each  with  one  head.  We  will  call  the  input  tape  head  A]  and  work  tape  head  or  queue 
head  A2.  We  say  that  a  poll  occurs  hen  the  head  h\  moves  one  cell.  At  any  time  t, 
h{  (t )  denotes  the  position  of  head  on  its  tape. 

In  the  following  lower  bound  proofs,  the  input  will  be  separated  into  blocks.  We 
will  observe  the  behavior  of  the  machine  as  the  head  polls  the  successive  cells  in  a  block. 
Although  the  definitions  and  the  Jamming  Lemma  are  stated  with  respect  to  one-tape 
TM’s  for  simplicity,  they  also  apply  to  one-qu®ue  machines  where  the  work  tape  is 
replaced  by  a  queue. 

Definition  2.1:  Let  z,-  be  a  block  of  input,  and  R  be  a  tape  segment  on  the 
storage  tape.  We  say  that  M  maps  *,■  into  R  if  Aj  never  leaves  tape  segment  R  while 
h1  is  reading  xt .  We  say  M  maps  *,■  onto  R  if  A2  traverses  the  entire  tape  segment  R 
without  leaving  R  while  h  x  reads  z, . 

Definition  2.2:  A  crossing  sequence  (c.s.)  associated  with  the  boundary  between 
two  contiguous  work  tape  cells  is  a  sequence  of  ID’s  of  the  form  (M(t  ),Ai(0)>  where 
M(t)  is  the  state  of  the  machine  at  time  t,  for  each  time  t  when  the  machine  crosses 
that  boundary. 

We  prove  an  intuitively  straightforward  lemma  for  one-tape  machines  with  one-way 
input.  The  lemma  states  that  M  cannot  poll  too  many  input  symbols,  with  A2  located 
on  a  given  small  tape  segment  bordered  by  short  c.s.’s,  without  losing  some  information. 
Formally: 

Jamming  Lemma.  Let  the  input  string  start  with  z#  =  xxx2  ■  ■  ■  z*  #  ,  with  the 
x,  ’s  blocks  of  equal  length  C .  Let  R  be  a  segment  of  M ’s  storage  tape  and  let  l  be  an 
integer  such  that  M  maps  each  block  xiif  ...  ,  z,-(  (of  the  z,-  's)  into  tape  segment  R  .  The 
contents  of  the  storage  tape  of  M ,  at  time  t#  when  A1(f^)  =  j  z#  |  and 
Aj(f#-1)  —  \  x  \  ,  can  be  reconstructed  by  using  only  the  blocks  z;-  •  •  •  zy  which 
remain  from  xx  ■  ■  ■  xk  after  deleting  blocks  xit,  ...  ,  xi(,  the  final  contents  of  R  ,  the  two 
final  c.s.  ’s  on  the  left  and  right  boundaries  of  R ,  a  description  of  M  and  a  description  of 
this  discussion. 
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Remark  2.3:  If  we  want  to  give  a  description  of  a  sequence  of  different  strings  of 
variable  length,  we  use  self-delimiting  strings,  adding  O  (logn  )  bits  for  each  string  of 
length  n  . 

Remark  2.4:  Roughly  speaking,  if  the  number  of  missing  bits  £  ‘  , t  \  Xj  |  is 
greater  than  the  number  of  added  description  bits  then  the  Jamming  Lemma  implies 
that  either  r=zI  ■  •  •  xk  is  not  incompressible  or  some  information  about  x  has  been 
lost. 


Proof  of  the  Jamming  Lemma.  Let  the  two  positions  at  the  left  boundary  and 
the  right  boundary  of  R  be  Ir  and  tr  ,  respectively.  We  now  simulate  A/  Put  the 
blocks  Xj  of  Xj  •  ■  ■  x jt  (  in  their  correct  positions  on  the  input  tape  (as  indicated  by  the 
hl  values  in  the  left  and  right  c.s.’s).  Run  M  with  /i2  staying  to  the  left  of  R  .  When¬ 
ever  /to  reaches  point  Ir  ,  the  left  boundary  of  R  ,  we  interrupt  M  and  check  whether 
the  current  ID  matches  the  next  ID,  say  IDit  in  the  c.s.  at  Ir  .  Subsequently,  using 
IDi+x,  we  skip  the  input  up  to  and  including  Ai(fj+1),  adjust  the  state  of  M  to  M(?t+1), 
and  continue  running  M.  After  we  have  finished  left  of  R  ,  we  do  the  same  thing  right 
of  R  .  At  the  end  we  have  determined  the  appropriate  contents  of  M’s  tape,  apart  from 
the  contents  of  R  ,  at  h  (i  e->  the  time  when  Aj  reaches  #  ).  Inscribing  R  with  its  final 
contents  from  the  reconstruction  description  gives  us  M’s  storage  tape  contents  at  time 
.  Notice  that  although  there  are  many  unknown  if-  ’s,  they  are  never  polled  since  A ) 
skips  over  them  because  A2  never  goes  into  R  . 

Remark  2.5:  If  M  is  nondeterministic,  then  we  need  to  rephrase  “contents  of 
storage  tape”  by  “legal  contents  of  storage  tape”,  which  simply  means  that  some  compu¬ 
tation  path  for  the  same  input  would  create  this  storage  tape  contents. 


3.  Lower  bound  for  simulating  one  queue  by  one  tape 


We  present  a  tight  lower  time  bound  for  deterministic  simulation  of  one  queue  by 
one  off-line  tape  with  one-way  input.  (For  a  lower  bound  for  the  nondeterministic  case, 
see  ILVj  or  [Li3j.) 

Remark.  Only  in  this  section  3,  g(n)£Q(f  (n))  means  “there  is  a  positive  con¬ 
stant  6  such  that  g{n)~>6f  ( n )  infinitely  often”.  F.verywhere  else  the  results  hold  for 
the  stronger  variant  of  Q:  “there  exist  a  positive  constant  b  and  a  positive  integer  n0 
such  that  g  (n  )>/>/  (n  )  for  all  n  >  n0”. 

Let  <prejiX  mean  ‘is  a  prefix  of.’  Let  S  =  {0,l}x{0, 1.5,1, e } ,  where  r  denotes  the 
empty  string,  and  consider  the  words  over  E  of  the  form 


(fli.6,)(o2.6,)  ■  •  •  (a„  ,b„) 

such  that  for  all  i ,  , 

ai  €{0,1 }  and  6j€{ 0, 1 ,0,T,c } 

^1^2  '  bi  <prefiz  '  '  “i  • 

where  for  any  pair  (<t  ,b  )£E  we  define  h  by 


(3.1) 


31 
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a  =  a  if  6  =  f 
a  =  a  if  b  ^  e  . 

Remark.  Words  of  this  form  constitute  the  witness  language  Lq  below,  which  is 
real-time  acceptable  by  a  queue  but  which  requires  Q(n2)  time  for  acceptance  by  a  tape. 
Think  of  the  sequence  ajOj  •  •  ■  aB  as  the  n-length  sequence  of  bits  to  be  stored  con¬ 
secutively  in  the  queue,  and  the  sequence  fttft 2-  •  ■  bn ,  of  length  m  (m<n),  as  the 
sequence  of  bits  which  are  consecutively  unstored  from  the  queue.  (Note,  that  while 
fli  for  all  1 ,  it  is  possible  that  ft,-  =e  for  some  i  (l<*  <n ).  That  is,  (a,-  ,ft, )  specifies 
that  e  be  unstored.)  For  technical  reasons  in  the  proof  below,  we  have  to  complicate  this 
scheme.  All  of  the  prefix  of  a^a2  •  ■  o„  which  has  been  stored  in  the  queue  previously, 
needs  to  remain  stored  in  the  queue  forever.  Nonetheless,  to  force  the  queue  to  operate 
correctly  we  need  to  be  able  to  unstore  it.  To  combine  both  requirements,  each  pair 
(  a,  , ft, )  causes  the  queue  not  only  to  store  and  to  unstore  ft,  (possibly  e),  but  also  to 
store  6,  anew.  Below  we  show  that  the  scheme  of  barred  and  unbarred  a,-  ’s,  related  to 
whether  or  not  the  associated  ‘unstore’  ft,-  ’s  are  e  or  not,  makes  it  possible  to  retrieve  the 
complete  sequence  of  a,- ’s,  in  the  order  they  have  been  stored  originally,  from  the  queue 
contents  at  each  instant. 

The  witness  language  Lq  consists  of  all  words  satisfying  (3wl).  To  aid  intuition,  we 
can  view  Lq  as  the  language  accepted  by  a  queue  Q  as  follows: 

•  Initially,  Q  is  empty. 

•  For  all  «>1,  input  command  ‘(a,,ft,-)’  to  Q  is  interpreted  by  Q  as  ‘if  6,  —e  then 

append  a,-  to  the  rear  else  append  3*  to  the  rear;  delete  ft,  up  front;  append  ft,  to 

the  rear.’  (Here  ‘ action  1  ;action2;actk>n3’  denotes  the  sequential  execution  of  actionl, 
action2  and  action3.) 

•  A  word  (a^ftj)  •  •  •  (an,ftn)  is  accepted  if  the  sequence  of  successive  front  items 

deleted  in  the  actual  computation  by  Q  on  thir  input  is  the  sequence  ft  x,  .  .  .  ,  ftn  . 

All  other  words  are  rejected. 

The  properties  of  words  of  form  (3.1)  we  need  in  the  sequel  are  expressed  in  the  following 
three  lemmas. 

Lemma  3.1:  For  a  word  of  the  form  (8.1),  \  albla2b2  ■  •  a,-  ft,-  | 

-  |  ft  i  ft  2  '  '  ‘  ft<  |  —  *  for  all  i ,  1  <  i  <  n  . 

Proof:  Obvious. 

Lemma  3.2:  For  a  word  of  the  form  (3.1)  we  can  reconstruct  ala2  •  •  •  an  from  the 
n  -length  suffix  of  a  lbld2bi  •  an  ft„  . 

Proof:  Let  the  n-length  suffix  be  X\X<i  xn  with  *,€{0,1,0,!}  (l<t <n).  By 
(3.1)  one  of  the  following  two  cases  must  hold  (note  that  the  combination  *„_j€{0,1} 
and  x„  6(0.1}  is  impossible): 

(a)  Assume  x„  ,*„ -i€{0,l}.  Then  aH—xn  and  ft„  =e  by  (3.1).  Consequently, 
*1*2  xn  _j  is  the  (n  -l)-length  suffix  of  a1bla2b2  •  an-i^n-i  by  (3.1). 


(b)  Assume  £„_!€{(), 1}-  Then  an—xn_l  and  bn—xn  by  (3.1).  Consequently, 
xnxxx.2  ■  ■  ■  xn_2  is  the  (n-l)-length  suffix  of  d1bld2b2  '  '  “n-i^n-i  bv  (3.1). 
(Because  6„  is  the  last  unstored  symbol  which  has  been  appended  to  the  rear  of  the 
queue,  it  is  the  last  symbol  to  have  been  unstored  from  the  front  of  the  queue. 
Therefore,  to  restore  the  queue  contents  just  before  ( a„,bn )  is  processed,  we  delete 
suffix  an  bn  from  x  jio  ■  •  •  xn  and  prefix  the  remaining  string  with  bn  .) 

Iterating  this  reasoning  n  times  we  recover  all  of  a  ja2  •  •  •  an  .  This  proves  the  lemma. 

Lemma  3.3:  For  a  word  of  the  form  (8.1)  with  |  6 1  •  -  •  6„  |  =  m  ,  we  can  recon¬ 
struct  ala2  ■  ■  ■  am  i2  from  bx  ■  •  ■  bn  . 

Proof:  Let 

b  \  b  2  ■  ■  b„  =  xxx2  ■  ■  •  xm>  **€{0,1, 0,1)  (l<i  <m). 

By  (3.1)  we  have  dt  =  xv 

(a)  If  £iG{0,1}  then  a^ij  and  6  i=e .  Consequently,  x2  •  •  •  xm  is  the  (m-l)-length 
prefix  of  a262  •  •  ■  a„  b„  by  (3.1). 

(b)  If  *i€{0,l}  then  a1=i1  and  bi=x2.  Consequently,  x3  ■  ■  ■  xm  is  the  (m-2)-length 
prefix  of  a2b2  •  •  •  a„  6„  by  (3.1). 

Iterating  this  reasoning  m/ 2  times  we  recover  all  of  ai<i2  ■  •  ■  am/2.  This  proves  the 
lemma. 

Theorem  3.2.  It  requires  Q(n2)  time  to  deterministically  simulate  one  queue  by 
one  off-line  tape  with  one-way  input. 

Proof.  (I).  Assume,  by  way  of  contradiction,  that  an  off-line  deterministic  1-tape 
machine  M  with  one-way  input  accepts  Lq  in  time  T(n  )(£fl(n2).  We  derive  a  contradic¬ 
tion  by  showing  that  then  some  incompressible  string  has  too  short  a  description. 
Without  loss  of  generality,  it  can  be  assumed  that  M  has  a  semi-infinite  storage  tape 
;0,oo)  on  which  it  writes  only  0’s  and  l’s,  The  positions  at  time  t  are  denoted  by  h.(<) 
and  h2(t).  By  t,-  we  denote  the  time  when  the  i  th  input  command  is  polled,  i.e., 
h  i(f;  )=i  and  h  l(ti-l)=i -l.  Fix  a  constant  C  and  the  word  length  n  as  large  as 
needed  to  derive  the  desired  contradictions  below  and  such  that  the  formulas  in  the 
sequel  are  meaningful.  Below  we  show  that  T(m)>m2/C 4,  for  some  m. 
vn  ,'C  <m  <n  ,  which  contradicts  the  assumption  and  proves  the  theorem. 

First,  choose  an  incompressible  string  x  6(0,1}  *  of  length  n.  We  consider  the 
behavior  of  M  on  a  fixed  input  prefix.  This  can  be  any  string  z  such  that 
x  =1^2  ■  ■  ■  xn  ,  y=y  iy2  •  •  •  y„  and  z  ={x  hy  x){x 2,y t)  •  •  ■  ( xn  ,yn  ),  for  some  y  such 
that  z  satisfies  (3.1).  Therefore,  zELq .  If  many  polls  occur  while  the  head  h 2  is  in  some 
small  area,  then  we  can  show  that  x  is  not  incompressible  (Case  1).  Otherwise,  we 
choose  particular  y, ’s,  among  the  possibilities  which  remain  under  this  constraint,  so  as 
to  suit  the  argument  in  Case  2  below. 

Case  l  (Jammed).  Fix  an  integer  m  such  that  \fn  /C  <m  <n  (any  such  integer 
in  will  do)  and  consider  the  m -length  prefix  z{m)  of  z.  By  (3.1),  if  z  is  in  Lq  then  so 


is  each  prefix  of  z ,  so  in  particular  z(m  )€Lq  .  Assume,  by  way  of  contradiction,  that  in 
the  accepting  computation  on  z(m)  at  least  2m  /C  polls  occur,  with  h2  on  a  particular 
(m /C)-length  tape  segment  R  =  [a  ,  a+m/C).  Consider  the  two  tape  segments  Rt 
and  Rr  of  length  |  R  |  /A  left  and  right  of  R  .  Choose  positions  Pi  in  /?/  and  pr  in  Rr 
with  the  shortest  c.s.’s  in  their  respective  tape  segments.  These  c.s.’s  must  both  be 
shorter  than  m/C2,  for  if  the  shortest  c.s.  in  either  tape  segment  is  longer  than  m  / C 2 
then  M  uses  T (m  )>m2/4C 3  time,  which  is  a  contradiction.  (If  0<a  <m  /AC  then  set 
|  Rt  |  =a  ,  so  that  R[RRr  C[0,oo).  Choose  pt  =  0  and  note  that  the  length  of  the  associ¬ 
ated  c.s.  can  be  set  to  0.)  We  show  that  a  short  program  can  be  constructed  which 
accepts  only  x  .  Let  the  bits  of  xx  ■  ■  ■  xm  polled  with  h 2  outside  tape  segment  (p/ ,  pr], 
concatenated  in  the  order  in  which  they  occur  in  x ,  form  a  string  u  . 

As  explained  below,  we  can  construct  a  program  to  check  if  a  string  x1  £{0,1}* 
equals  x  ,  using  as  a  description  the  values  of  n  ,  m  ,  C ,  a  ,  the  locations  of  pt  and  pr  , 
the  two  c.s.’s  at  pt  and  pr ,  the  self-delimiting  version  of  tt ,  the  bits  xm+i  ■  ■  ■  x„  ,  the 
final  contents  of  [pt ,  pr\  at  time  tm+ 1(  the  state  of  M  at  time  tm+1  and  A  2(*Tn  -^i)- 

This  description  of  x  requires  no  more  than  n  — bits,  for  sufficiently  large  C 

AC 

and  n.  However,  this  contradicts  the  incompressibility  of  x  since  K(x)>n  and 
m  >  \fn  /  C  . 


To 


check 
.  / 


whether  a  string  x1  equals  x ,  check  |  x '  |  —n  and 
x'  m  +  i  •  •  •  x'  „  =xm+l  ■  ■  ■  x„  .  By  the  Jamming  Lemma  (using  the  above  information 
as  related  to  M’s  processing  of  the  input)  reconstruct  the  contents  of  M’s  storage  tape 
at  time  /m+1,  after  processing  z  (m  )=(ar  i,y  i)  •  •  •  (*m,ym).  Simulate  M  from  time  tm+1 
onwards  on  an  input  suffix 

(0.!/m+i)(0,ym+2)  • '  •  (0,y2m)  (3.2) 

with  I  J/m+iVm+2  '  '  '  y 2m  |  =m ,  and  such  that  M  accepts  for  the  chosen  y,  ’s 
( m  +l<t  <2m ).  It  is  easy  to  see  from  (3.1),  that  there  is  such  a  suffix  (3.2)  for  which 
M  accepts  if  a:'  lx'  2  ' 

Lemma  3.1,  ym+lym+2 


x'  m  =ZjZ2  •  •  ■  xm  .  In  that  case  x 1  —x ,  and  by  (3.1)  and 
'  Vim  equals  the  m-length  suffix  of  xxyx  ■■■  xmym  .  By 


Lemma  3.2,  we  can  retrieve  x1x2 
such  that. 


xm  from  this  suffix.  Suppose,  there  is  a  x'  ^  x 


z'  [rn)  =  (x1  hy’  j)(z 1  2 ,y'  2)  ■  ■  '  {*'  m,y'  m)  (3.3) 

matches  the  description  above,  and  z'  (m )  drives  M  into  the  same  configuration  at 
time  t'  m+i  of  M’s  (m-fl)th  poll  in  its  computation,  as  the  configuration  into  which 
z(m  )  drives  M  at  time  tm+i-  Consequently,  the  concatenation  of  (3.3)  and  (3.2)  is  also 
accepted  by  M.  Note,  that  x'  differs  from  x  only  in  the  first  m  bits,  in  particular  in 
those  bits  polled  with  h2  positioned  in  tape  segment  \pi,pr\.  We  can  cut  and  paste  the 
computations  based  on  z'  (m)  inside  (p{,pr].and  based  on  z(m)  outside  [p/,pr],  and 
still  have  M  accept.  The  ‘cut  and  paste’  computation  is  accepting  up  to  the  (m+l)th 
poll  because  both  computations  satisfy  the  description  above,  and  afterwards  because 
the  two  computations  are  identical  from  the  (m+l)st  poll  onwards.  Let  the  resulting 
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string  composed  in  the  obvious  way  from  / ,  •  xm  and  x'  x  ■  ■  ■  s'  be 

\(m)~\1  •  •  ■  y„,  with  .x '  ,}  (l<»<m).  Above  we  saw  that  we  can  retrieve 

.r  jj o  •  •  ■  xm  from  ym+1  •  •  •  y2m  ,  by  Lemma  3.1  and  Lemma  3.2.  However,  tin-  con¬ 
tradicts  the  acceptance  by  M  of  the  cut  and  paste  computation  based  on  z(m)  and 
z'  (m  ),  because  that  entails  the  retrieval  of  \(ni  xxx2  ■  ■  •  xm  from  ym±\  •  •  //•>,„ 

by  (3.1),  Lemma  3.1  and  Lemma  3.2. 

Case  2  (Not  jammed).  Let  n'  be  any  integer  such  that  \fn  /C  <n'  <n  .  Let 
:(n'  )  be  the  n'  -length  prefix  of  an  n-length  input  z.  By  (3.1),  if  z&Lq  then 

z(n'  )CLq.  Assume,  by  way  of  contradiction,  that  in  the  accepting  computation  of  M 
on  z(n'  )  at  most  2 n'  /C  polls  occur  with  ho  on  any  particular  ( n 1  /C')-leng:h  tape 
segm en t  R  =  [a  ,  a  +  n  '  / C  ). 

We  now  define  the  particular  input  z  we  need.  Let  x=xxx2  ■  ■  ■  xn  be  as  in  Case  1. 
Determine  the  y,  ’s  ( 1  <  i  <  n  )  in  input  z  ~  (x  ■  ■  ■  (x„  ,y„  )  as  follows. 

(1)  Let  M  start  its  computation  with  yx=(.  So  first  (xi,2/i)  is  polled. 

(2)  Let  M  continue  its  computation  and  suppose  we  have  determined 

(xiil/i)  '  '  '  (xi-\Ni-\)  and  M  polls  for  the  *  th  time.  Let  £,■  be  the  time  at  which 

M  polls  (xj  ,y, ).  If  h  2(t,-  )6[0,n  /4)  then  y,- —  e,  else  y,-  ^  e.  In  the  latter  case  y{  is 

be  determined  uniquely  from  (x  i,y  i)  •  •  •  (*»_i,y»_i)  by  using  the  relation 
Uiy-2  •  •  •  y,  <  prefix  x\y\  ■  •  •  z,-iy,-i.  that  is,  using  (3.1)  and  the  fact  that  yx=e 
by  (1). 

We  now  fix  a  particular  value  m  as  determined  by  M’s  computation  on  5  . 

(a)  Fly  contradictory  assumption  (with  n'  —n  ),  we  have  that  <n/2  polls  occur  on 
Q.ii  /A)  and  >n  /2  polls  occur  on  in  /2,oo). 

(b)  Since  T  (n  )<£fi(n2),  we  have  /i2(<»'  )EiO,n  /  4),  for  all  i  (l<i  <\AT  ). 

Let  l(t  )  and  r  (t )  be  the  number  of  polls  for  (x,,y,)’ s.  with  h2(ti  )€|0,?i  /A)  and 

h  2(  L  )£'  n  /  Loo)  (1  <ti  <t  ),  respectively.  By  (a)  and  (b)  there  is  an  integer  m  such  that 
l(t)>r(t),  for  l<<<tm,  l(tm)  —  r(tm)  and  \fn  /  C  <m  <n  .  This  m  is  the  break 
even  length  where  the  number  of  polls  left  end  right  of  position  n  /A  on  the  tape  is  equal 
for  the  first  time. 

Claim  I.  As  a  consequence  of  this  definition  of  m  and  (1)  and  (2),  it  follows  that 
r  (I  m  )  |  y,  •  •  •  ym  |  =w  /2  for  input  prefix 

~(m  '  =  f.r  j,y ,)  •  •  •  (xm  ,ym  )• 

oince  each  prefix  of  c  satisfies  (3.1),  we  can  retrieve  x  ]  ■  •  •  xm  /4  from  prefix  yx  ■  ■  ■  ym 
<>f  xxy  t  ■  ■  ,rm  ym  by  Lemma  3.3. 

Claim  2.  By  definition,  all  y,- ’s  in  y,  •  ym  ,  which  are  different  from  c.  are  polled 

on  n  Loo).  Since  l  (/„,  /4)  >  r  (tm  /.,),  at  most  m  '8  of  the  x, ’s  in  x  j  ■  •  •  x,„  /4  are  polled 

on  n  l,oo). 

lu  the  computation  on  the  m-length  prefix  z(m  )  of  c  ,  choose  the  point  p  with  the 
-hortest  c  s.  in  \n/A-  m/C ,  n/4).  This  c.s.  is  shorter  than  m/C2;  otherwise,  the 


«r .  y, 


running  time  T(m)>m2/C3,  which  is  a  contradiction. 

As  explained  below,  we  can  construct  a  program  to  check  if  a  string  x'  E{0,1}  * 
equals  x ,  using  as  a  description  the  values  of  n ,  m  ,  the  position  of  p  ,  the  c.s.  at  p  ,  the 
string  u  of  concatenated  bits  of  Zj  •  ■  zm/4,  polled  with  h2  on  [p  >  °°)  and  the  string 

1  (m  /4)  +1  '  ‘  *n  ■ 

Tfl 

This  description  of  x  requires  no  more  than  n - bits,  for  sufficiently  large  C 

16 

and  n  .  However,  this  contradicts  the  incompressibility  of  x  since  K(x)>n  and 
m  > \fn  /  C  . 

To  check  whether  a  string  x'  equals  z,  check  |  x'  \  —n  and 
x'  •  •  •  x'  n—  i(m/4)+i  •  ■  ■  x„  .  Let  u'  be  the  result  of  deleting  the  bits  in 

x 1  in  the  same  positions  as  the  ones  used  to  obtain  u  from  x .  These  positions  are 
determined  by  the  crossing  sequence  at  p.  Check  u'  =u .  If  the  test  is  negative  then 
x'  7^  x ,  else  x'  can  only  differ  from  x  on  positions  where  z4  •  •  •  zm/4’s  bits  are  polled 
with  h2  on  [0,  p ).  Run  M  on  z'  (m ),  that  is,  the  input  constructed  according  to  (1), 
(2),  using  the  m -length  prefix  x'  xx’  2  •  •  ■  x'  m  of  a  candidate  x'  .  Whenever  h2 
crosses  p  we  interrupt  M  and  check  if  the  current  ID  in  the  computation  is  consistent 
with  the  corresponding  ID  in  the  c.s.  at  p  . 

By  construction  everything  matches  up  to  the  end  of  processing  input  z'  (m),  and 
M  accepts,  if  x'  —x.  Assume  that  x'  x  matches  the  description  as  well.  There¬ 
fore,  x'  jz'  2  •  •  •  x'  m/4  ^  *1*2  '  ’  ‘  xm/4  a“d  x>  »—*,-  for  all  »  (m/4 +l<t  <n ). 
Let  the  input  z1  (m),  based  on  x'  1xl  2  •  •  •  z1  m  and  constructed  according  to  (l),(2), 

be 

z'  {m)  =  {x'  lty'  j)(z 1  2,y'  2)  •  •  •  (*'  m,y'  m)  • 

Let  the  input  based  on  xlx2  ■  ■  ■  xm  ,  constructed  according  to  (1),  (2),  be 

z(m)  =  {x  j.yjX*  2.  y*)  ■  ■  ■  (*m  >Vm )  • 

By  assumption,  z'  and  z  differ  only  on  the  first  m  /4  bits,  and  then  only  on  the  bits 
that  are  polled  left  of  p  .  Let  the  final  accepting  position  of  h2  for  A/’s  computation  on 
z  (m  )  be  right  of  p  .  (If  it  is  left  of  p  interchange  z  and  z'  below.)  Cut  and  paste  the 
computations  on  z{m)  and  z'  (m )  such  that  M  runs  on  input  z'  (m)  with  h2  left  of 

position  p,  and  M  runs  on  input  z(m)  with  h2  right  of  position  p.  Let  $(m)  be  the 

input  composed  in  this  way  from  z'  (m)  and  z(m).  By  construction,  the  computation 
on  )  is  also  an  accepting  computation  of  M.  Consequently,  <{m  )  satisfies  (3.1). 
Then, 

$(m  )  =  (a1,6,)(a2,62)  •  •  (am  ,6m) 

with  (a,  ,6, )  is  either  (zf  ,y,- )  or  (x1  ,  ,y '  i)(l<*  <m).  Because  both  z(m  )  and  z'  (m) 
match  the  description,  (x1  ,-,y'  ,  )  is  polled  right  of  position  p  if  and  only  if  (z,  ,y, )  is 
polled  right  of  position  p  for  all  i  ,  l<i  <m.  Therefore,  y,-  =e  if  and  only  if  y'  ,■  —  <  if 

and  only  if  (ar,-  ,y,- )  is  polled  left  of  position  p  if  and  only  if  (x1  ,  ,y'  ,  )  is  polled  left  of 

position  p  for  all  i,  l<i<m.  Consequently,  the  sequences  of  'unstored’  symbols 


(3.4) 


unequal  e  in  the  computations  on  2(7/1),  z'  (m)  and  $(m)  are  equal,  that  is, 

6160  '  '  '  f>m  =  yxy 2  ■  ■  ■  ym 

By  assumption,  (x{  ,t/, )  and  (x1  ,• ,»/,  )  are  polled  left  of  p  and  z,  7^  zt- '  for  some  1, 
1  <  i  <7/1  /4.  Therefore,  since  a,  =z '  ,•  if  the  1  th  poll  occurs  left  of  p  ,  and  a,-  =zt-  if  the 
/  th  poll  occurs  right  of  p  (l<i<m/4),  in  M’s  computation  on  <^m  ),  we  have 
a  xa2  •  ■  •  am  /4  7^  z  1Z0  •  •  ■  zm  /4.  Because  ztz2  ■  •  ■  zm/4  is  retrieved  from  y\-ym  by 
Claim  1,  we  retrieve  ZjZo  ■  ■  •  zm/4  from  6  2  •  •  ■  bm  as  well,  by  (3.1),  using  Lemma 

3.3.  However,  for  <^m)  to  satisfy  (3.1),  we  have  to  retrieve  •  •  •  am//4  from 

6160  ■  •  •  bm ,  by  virtue  of  the  construction  of  <{m)  and  Claim  1.  Consequently, 
616o  •  •  •  6m  is  not  a  prefix  of  albla2b2-  •  -  am  bm  as  required  by  (3.1).  Hence,  <^m) 
does  not  satisfy  (3.1),  which  is  a  contradiction. 

Since  m  >  \fn  / C ,  Cases  1  and  2  complete  the  proof  of  r(n)GO(n2). 

(II).  With  the  description  of  Lq  we  have  already  indicated  how  a  queue  recognizes 
this  language  in  real-time. 

The  theorem  follows  from  (I)  and  (II). 

4.  Lower  bound  for  simulating  one  pushdown  by  one  queue. 

In  this  section,  we  show  that  it  takes  at  least  fi(n4/3/logri )  time  for  a  one-way 
input  one  queue  nondeterministic  machine  to  recognize  the  language 
{«■#><>*  :wE{0,l}'}. 

Because  this  language  can  be  recognized  in  linear  time  by  a  deterministic  pushdown 
automaton,  we  can  conclude  that  it  takes  at  least  fi(n4/3/logn  )  time  for  a  one  queue 
nondeterministic  machine  to  simulate  a  deterministic  pushdown  automaton. 

The  intuition  behind  the  proof  is  that  while  the  queue  machine  reads  w ,  it  has  to 
store  all  the  information  in  some  sequential  way  on  the  queue.  To  compare  this  informa¬ 
tion  with  wR  ,  the  machine  will  have  to  go  through  the  queue  too  many  times. 

Let  hr  be  the  read-only  head  on  the  one-way  input  tape.  We  can  view  the  queue  as 
a  tape  with  two  heads  h2  and  /j3.  The  head  h2  is  a  read-only,  one-way  head  on  the 
queue.  The  head  h3  is  a  write-only,  one-way  head  on  the  queue.  Each  time  something  is 
put  on  the  queue,  /i3  writes  and  each  time  something  is  read  from  the  queue,  h2  reads. 

Theorem  4.1:  A  one-way  input  one  queue  nondeterministic  machine  takes  time  in 
Lt(n 4  3 'log/;  )  to  accept  the  language  {w#wR  :u>6{0,1}*  }. 

Proof:  Leading  to  a  contradiction,  we  suppose  that  there  is  an  algorithm  to  accept 
/.  in  time  T(n  )  which  is  not  in  fl(n4/,3/logn ). 

Let  /»,  (< )  denote  the  position  of  head  i  at  time  t  on  its  respective  tape.  At  time  t , 
the  length  of  the  queue  is  hz{t  )-h  2(t ),  and  the  content  of  the  tape  between  /i2(<)  and 
//3(/ )  is  called  the  actual  queue  . 

Let  z  be  an  incompressible  string.  We  separate  z  into  blocks:  x~  xQxlx2  •  •  zm  . 

Each  block  z,  for  l<i<m  is  separated  into  p  subblocks:  z,-  =z, tXi2xi3  ■  ■  ■  xip  .  For 
the  proof  of  the  theorem,  we  take  m—n */3  and  p  —n2^/k  jlogn ,  where  kx  is  an 


appropriately  chosen  constant.  Let  |  *  |  —n  and  |z0|=n/2.  Each  subblock  will 
have  the  same  length,  c  logo .  We  look  at  any  computation  of  the  machine  on  input 
x#xR . 

Claim  4.2:  If  n  /2<hx(t)<3n  /2,  then  the  length  of  the  queue  at  time  t  is  at  least 
n  /  2-log  n  . 

Proof:  We  know  that  K (x0)>n  /2-k2  for  some  constant  k2 .  The  result  follows 
because  x0  can  be  described  by  the  content  of  the  queue,  the  state  of  the  machine  and 

MO- 

Claim  4.3:  Let  tj  be  the  time  step  when  the  input  head  enters  the  block  x, .  For 
at  least  half  of  the  blocks  Xj  ,  we  have  that  Mf>'+i)<Mf/)- 

Proof:  Otherwise  the  algorithm  takes  time  fl(n4^8/logn ). 

The  machine  needs  to  remember  what  it  reads  on  the  input  and  code  it  in  some 
way  on  the  queue.  This  notion  will  be  captured  by  the  influence  relation  defined  below. 
What  is  written  on  the  queue  can  be  a  coding  of  the  input  and  of  the  rear  of  the  queue. 
If  k2{tj+1)<h3(tj),  then  we  have  the  nice  property  that  a  whole  block  from  the  input 
has  to  be  coded  sequentially  on  the  queue,  since  the  reading  head  on  the  queue  doesn’t 
reach  where  the  coding  has  started.  Let’s  call  the  blocks  which  satisfy  this  last  claim  the 
valid  blocks. 

Now,  we  define  the  influence  relation.  Let  cx,e2,  •  ■  •  c,  be  the  cells  on  the  input 
tape.  Let  dx,d2,  •  ■  •  d,  be  the  cells  on  the  queue.  We  say  that  a  cell  dj  is  directly 
influenced  by  a  cell  c,-  if  hx(t  )=i  at  the  time  t  when  hz  writes  on  d}- .  Similarly,  a  cell 
dj  is  directly  influenced  by  a  cell  d,  if  h^t)—i  at  the  time  t  when  h3  writes  on  dj  . 

The  influence  relation  is  the  transitive  closure  of  the  direct  influence  relation.  We 
say  that  c,-  (or  d{ )  influences  dj  if  there  is  a  chain  of  direct  influences  from  c*  to  dj .  A 
block  of  cells  influences  a  cell  if  and  only  if  at  least  one  of  the  cells  in  the  block 
influences  it. 

The  influence  relation  will  allow  us  to  talk  about  where  the  information  can  be 
stored  on  the  queue.  Notice  that  during  the  computation,  the  content  of  a  cell  may  still 
be  dependent  on  some  other  input  cell  even  if  that  input  cell  has  no  influence  on  it,  due 
to  the  finite  control  of  the  machine.  This  minor  problem  will  not  cause  any  trouble. 

Claim  4.4:  For  any  block  Xj  such  that  M*/+i)<Mf/)t  we  have  that  each  cell  in 
Xj  influences  a  disjoint  set  of  cells  on  the  queue.  Moreover,  the  regions  influenced  by 
these  cells  form  an  ordered  sequence  of  regions  on  the  actual  queue  at  any  later  time. 

Now  we  look  at  what  happens  when  the  input  head  h  x  reads  the  second  part  of  the 
input.  Let  tj1  denote  the  time  when  the  head  hx  enters  the  block  XjR  corresponding 

to  Xj  . 

Claim  4.5:  There  is  at  least  one  valid  blpck  Xj  such  that  h2(tj_ x  )<h3(tj'  ). 
(Remember  that  Xj_x  R  follows  xj1  R  .) 
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Proof:  Otherwise  the  algorithm  takes  time  n(n4/3/logn ). 

In  the  following  two  claims,  we  mention  cycles  and  crossing  sequences.  A  cycle  is 
any  span  of  time  from  time  t  to  time  t  '  such  that  h*(t  '  The  crossing 

sequence  associated  with  the  border  between  cell  d,  and  cell  di+1  is  the  list  of  states  of 
the  machine  when  any  head  goes  from  cell  d,  to  cell  di+1.  Eecause  the  tape  is  in  fact  a 
queue,  the  crossing  sequence  will  have  length  2. 

Each  block  influences  a  series  of  regions  on  the  tape,  one  for  each  cycle  of  the 
queue.  The  crossing  sequence  around  a  list  of  regions  is  the  concatenation  of  the  cross¬ 
ing  sequences  under  the  border  of  each  region. 

Claim  4.6:  Throughout  r  cycles,  starting  at  time  t0'  ,  the  actual  queue  always  has 
length  at  least  n2^3-(r +A:4)logn  ,  for  some  constant  k4. 

Proof.  Let  x,  be  the  block  provided  by  claim  4.5.  Let  J  be  the  string  x  where 
x,  has  been  deleted.  Because  the  regions  influenced  by  the  are  ordered  on  the  actual 
queue,  and  the  regions  influenced  by  the  x,;-  R  are  in  reverse  order,  there  is  only  one  con¬ 
tiguous  region  which  can  be  influenced  by  both  a  subblock  x,-7  and  its  corresponding 
subblock  xijR  .  We  call  this  region  the  overlapping  region. 

At  any  time  after  t0'  ,  the  string  x  can  be  totally  described  by  J  ,  the  index  of 
x, ,  the  actual  queue,  the  crossing  sequences  around  the  overlapping  region  and  the  con¬ 
tent  of  the  regions  that  were  overlapping  at  each  cycle. 

Claim  4.7:  The  machine  makes  fl( n 2/3/logn  )  cycles  after  t0'  . 

Proof:  The  string  x  cm  be  described  by  J  ,  the  index  of  x,- ,  the  crossing  sequence 
around  the  overlapping  region  and  the  content  of  the  regions  that  were  overlapping  at 
each  cycle.  At  each  cycle,  this  information  is  of  length  O(logn),  so  it  takes  n2/3/logn 
cycles  to  gather  enough  information.  (At  the  end,  we  don’t  need  the  actual  queue,  so  r 
has  to  be  large  to  compensate.) 

By  the  last  two  claims,  the  machine  takes  time  in  ^(n^/logn ). 

5.  Simulating  more  queues  by  less  queues 

In  this  section  we  study  the  power  of  queue  machines  with  different  number  of 
queues.  We  first  show  that  2  queues  are  as  good  as  k  queues  in  the  nondeterministic 
case.  This  motivates  our  research  concerning  a  small  number  of  queues.  We  also  show 
that  1  queue  can  simulate  k  queues  in  quadratic  time,  deterministically  or  nondetermin- 
istically.  We  will  provide  tight,  and  almost  tight,  lower  bounds  for  our  simulations  men¬ 
tioned  above. 

5.1.  Upper  bounds 

Theorem  5.1:  Two  pushdown  stores  can  simulate  one  queue  in  linear  time,  both 
for  deterministic  and  nondeterministic  machines. 

Proof:  Let  P  be  a  two  pushdown  store  machine  with  2  pushdown  stores  pd  1,  pd  2. 
To  simulate  a  queue,  every  time  the  a  symbol  is  pushed  into  the  queue,  P  pushes  the 
same  symbol  into  pd  1.  If  a  symbol  is  taken  from  the  queue,  then  P  pops  a  (the  same) 
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symbol  from  pd  2  if  pd  2  is  not  empty.  If  pd2  is  empty  then  P  first  unloads  the  entire 
contents  of  pd  1  into  pd  2  and  then  pops  the  top  symbol  from  pd  2.  At  the  end  of  the 
input,  P  accepts  iff  the  1  queue  machine  accepts. 

Theorem  5.2:  Two  queues  can  nondeterministically  simulate  k  queues  for  any 
fixed  k  in  linear  time. 

Proof:  This  is  actually  the  same  technique  Book  and  Greibach  [BG]  used  to  prove 
the  same  theorem  for  tapes.  The  2  queue  machine  guesses  the  computation  of  the  k- 
queue  machine  computation  and  put  this  guess  on  1  queue.  Then  use  the  other  queue  to 
simulate  the  computation  of  each  of  the  k  queues  of  the  simulated  machine  and  check 
its  correctness  against  the  guess  on  the  first  queue.  We  refer  the  reader  to  [BG]  for  the 
details  (This  simulation  takes  0  ( kn )  time.  But  it  can  be  improved  to  real  time  using 
the  methods  developed  in  [BG].) 

Theorem  5.3:  3  pushdown  stores  can  nondeterministically  simulate  k  queues  in 
linear  time. 

Proof:  Combine  ideas  from  the  above  2  theorems.  I.e.  ,  guess  the  computation  of 
the  k  queue  machine,  and  put  the  guess  into  one  pushdown  store.  Save  this  guess  also  to 
another  pushdown  store  (but  put  a  marker  on  the  top).  Then  simulate  a  queue  and 
check  the  correctness  of  the  guess.  (The  simulation  needs  2  pushdowns,  one  of  the  push¬ 
downs  has  the  guessed  computation  saved  on  the  bottom.)  After  simulating  one  queue, 
retrieve  the  guessed  content  and  again  put  it  into  2  pushdowns.  Repeat  this  process  for 
each  queue. 

Notice  that  a  strange  phenomenon  occurs  here.  When  we  have  1  queue  and  1  push¬ 
down  store,  1  queue  is  better  in  the  sense  that  1  queue  can  accept  all  the  r.e.  languages 
but  1  pushdown  cannot.  However,  when  we  have  more  pushdown  stores,  more  pushdown 
stores  seems  to  be  better  than  queues  because  they  are  more  efficient. 

Theorem  5.4:  One  queue  can  simulate  k  queues  in  quadratic  time,  both  determin¬ 
istically  and  nondeterministically. 

Proof:  Here,  use  some  basic  simulation  schemes. 

This  also  relates  to  a  interesting  problem  of  whether  ”2  heads  (on  one  tape)  are 
better  than  2  tapes  (each  with  one  single  head)”.  Vitafiyi  [Vi3]  showed  that  2  tapes  can¬ 
not  simulate  a  queue  in  real  time  if  one  of  the  tape  has  only  o  (n )  cells  to  use.  Our  result 
here  shows  that  2  pushdowns  can  simulate  a  queue  in  linear  time.  It  would  be  interesting 
to  know  whether  this  can  be  done  in  real  time.  The  question  of  how  to  simulate  k 
deterministic  queues  by  2  queues  (like  the  Hennie  Stearns  simulation  in  the  tape  case 
[HS2])  remains  open. 

5.2.  Lower  bounds 

We  now  prove  optimal  lower  bounds  for  above  simulations.  We  define  the  language 


L  ={a  &  b  q*6  i  •  •  •  6*1# 

*oV*iV*i’*/  •  •  1 


** -1  *  (* -l)/2^*2 
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^  0^  (i  +l)/2^  \^2^(k+  3)/2^34 
n#  a 


^  2»  mod  (i +1)^»  ^  (2i  +l)mod  (t  +1) 


|  6,1  =  b2  =  6, 3  =  6f4  for  *  =  0,  .  .  .  ,  k  for  any  odd  k  } 


The  length  of  each  bj  is  a  fixed  constant  C .  The  superscripts  of  6,- ’s  are  used  only 
to  facilitate  later  discussions.  L  can  be  considered  as  a  modified  version  of  a  language 
used  in  [Ma],  We  have  added  a  string  a  on  the  both  ends.  The  purpose  of  a  is  to 
prevent  the  queue  from  shrinking  since  if  we  choose  a  to  be  a  long  random  string  then 
before  the  second  a  is  read  the  size  of  the  queue  has  to  be  larger  than  |  a  \  .  We  have  to 
prevent  the  queue  from  shrinking  because  otherwise  the  crossing  sequence  argument 
would  not  work.  In  order  to  prove  the  lower  bounds  for  queues  new  techniques,  in  addi¬ 
tion  to  those  used  in  [Ma,LV],  are  required. 

Theorem  5.5:  Simulating  two  deterministic  queues  by  one  nondeterministic  queue 
requires  ft(n2/log2n  loglogn  )  time. 

Proof:  We  will  show  that  L  defined  above  requires  fi(n2/log2n  1  oglogn  )  time  on  a 
one-queue  nondeterministic  machine  (always  with  an  extra  1-way  input  tape).  Since  L 
can  obviously  be  accepted  by  a  two-queue  deterministic  machine  in  linear  time,  the 
theorem  will  follow. 

Now,  aiming  at  a  contradiction,  assume  that  a  one-queue  machine  M  accepts  L  in 
o  (n2/\og2n  loglogn)  time.  Only  for  the  notational  convenience,  we  think  the  queue  of 
M  as  a  circular  tape  with  just  one  queue  head,  which  combines  the  push  head  and  the 
pop  head.  The  head  moves  clockwise  uniformly.  The  circular  tape  can  augment  (insert  a 
tape  square)  or  shrink  (delete  a  tape  square)  at  constant  cost  in  order  to  mimic  a  queue. 
We  call  it  Queue  and  write  |  Queue  (t)  |  to  denote  its  length  at  time  t.  Name  the 
input  head  h  l  and  the  queue  head  h2-  Initially  the  Queue  is  a  point,  a  degenerated 
ring. 

Choose  a  large  n  and  a  C  >10  |  M  |  +10  so  that  all  the  subsequent  formulas  make 
sense.  Choose  an  incompressible  string  X6{0,1}2".  Let  X=X  X  '  where 
I  X  |  =  |  X  1  j.  Equally  divide  X  '  into  k  +l=n /C  loglogn  parts, 
X  '  —x0xo  ■  ■  ■  xk ,  each  C  loglogn  long.  Consider  a  word  w£L  where  a—  X  and 
bj—Xf  for  1  <j  <5  and  0<i  <k  .  Fix  a  shortest  accepting  path,  P  ,  of  M  on  w  . 

Consider  only  the  path  P  .  Let  ts  be  the  time  when  hj  reaches  the  first  & ,  tf  9  be 
the  time  h ,  reaches  the  second  &  .  and  be  the  time  when  hi  reaches  #. 

Claivi  1.  |  Queue  (ts  )  \  >n  /2. 

Proof.  If  not,  we  can  conclude  that  K(X)<  [X  |  as  follows.  For  every  Y  such 
that,  |  Y  |  —  |  X  |  ,  let  Y =a'  y0  ■  ■  ■  yk .  Replace  the  last  a  after  the  &  sign  in  w  by 
a'  .  Using  the  (short)  description  of  the  queue,  start  to  simulate  M  from  time  t&  .  By  a 
standard  argument,  Y =X  iff  M  accepts.  Therefore  K(X)<  |X  |,  a  contradiction. 


By  a  similar  argument  as  in  Claim  1,  we  derive  Claim  2. 
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Claim  2.  |  Queue  ( t )  |  >n  /2  for  every  ts  <t  <t  s  . 

Claim  3.  The  crossing  sequence  from  time  ts  to  f  e  is  shorter  than 

_ n _ 

C  10log2ri  loglog  n 

Proof .  Follows  directly  from  Claim  2. 

If  the  Queue  cells  which  are  scanned  or  created  by  h2,  while  h  j  is  scanning  btJ,  are 
in  Q={q  x,  •  •  •  ,qu  },  then  we  say  that  V  is  mapped  into  Q .  Notice  that  b,J  is  at  first 
mapped  into  a  set,  Q ,  of  consecutive  sequence  of  cells.  But,  different  from  a  regular  Tur¬ 
ing  machine  tape,  Q  may  become  disconnected  because  other  queue  squares  can  be 
inserted  later.  We  say  that  6/  is  sequentially  mapped  if,  while  hx  scans  6/,  h2  did  not 
scan  any  Queue  cell  twice  (leave  and  re-enter),  that  is,  h2  did  not  make  a  round  trip  on 
Queue  .  We  say  that  b/  is  majorly  mapped  into  Q  if  6/  is  sequentially  mapped,  and  there 
are  two  substrings,  u  and  t; ,  of  5/  which  are  mapped  into  Q  and 

I  “  I  +  I  «  I  > — j - 1 

Remark.  According  to  above  definition,  a  6/  can  be  majorly  mapped  into  two  dis¬ 
joint  sets. 

Claim  4.  At  time  t#  ,  Queue  can  be  cut  into  two  segments,  Si  and  S2,  such  that 

(1)  S  1f~l5  2=^  and  5^52=  Queue  ; 

(2)  k /4  bf1’ s,  say  ,  •  •  ■  ,6^*  ,  are  majorly  mapped  into  S  j,  and  k  /4  6,1’ s,  say 

bjlx ,  •  ■  •  ,bjlt/4 ,  are  majorly  mapped  into  S2. 

’  '  -  ‘  »  *  ’^j}/ 4  }  — 

(3)  |S,|.  |S2|  >n/C*; 

Proof .  In  our  proof  only  properties  (1)  and  (2)  are  used,  (3)  is  stated  for  the  sake 
of  completeness.  We  will  only  give  proofs  of  (1)  and  (2).  The  proof  of  (3)  is  very  similar 
to  the  last  part  of  this  proof  and  we  leave  the  proof  of  (3)  to  the  interested  readers. 

go  k  i 

First  we  show  that  6,1  are  sequentially  mapped.  By  Claim  2,  for  ts  <t  <tf  & , 

lvv 

we  always  have  |  Queue  (f)  |  >n/2.  Therefore,  if  after  time  t8  more  than  k  / 100  6,-1’ s 
are  not  sequentially  mapped,  then  on  each  of  them  M  must  spend  at  least  n  / 2  time  to 
go  around  the  Queue  .  Altogether  M  would  be  spending  Q(n2/loglog  n  )  time,  a  contrad¬ 
iction.  Hence  at  least  99k  / 100  5,1’ s  are  sequentially  mapped. 

Now  we  can  easily  choose  two  points  p  ,q  on  Queue  to  cut  Queue  into  two  parts 
S\  and  S2  such  that  (1)  and  (2)  in  the  claim  are  true. 

>From  now  on,  we  will  always  consider  the  Queue  to  be  partitioned  as  5 j  and  S2. 
The  sizes  of  St  and  S2  may  increase  or  decrease.  If  anything  is  inserted  in  the  intersec¬ 
tion  point  of  5j  and  S2  then  it  does  not  matter  in  which  set  we  place  the  inserted 
Queue  cell. 

The  next  claim  is  a  simple  generalization  of  a  theorem  proved  by  Maass  in  jM’ 
(Theorem  3.1).  The  proof  of  this  claim  is  a  simple  reworking  of  the  proof  in  [M; 


Claim  5.  Let  S  be  a  sequence  of  numbers  from  {0,  .  .  .  ,  k- 1},  where  k—  2  for 
some  /.  Assume  that  every  number  6  €  {0,  .  .  .  ,  k-l}  is  somewhere  in  5  adjacent  to 
the  number  26  (mod  k)  and  26  (mod  k)+l.  Then  for  every  partition  of  {0,  ,  Ar  —1 } 

into  two  sets  G  and  R  such  that  5  =  GUi?  and  j  G  |  ,  |  R  |  >  k  / 4  there  are  at  least 
k  /(clog  k)  (for  some  fixed  c)  elements  of  G  that  occur  somewhere  in  S  adjacent  to  a 
number  from  R  . 


A  k  / \/log  k  upper  bound  corresponding  to  the  lower  bound  in  this  lemma  is  con¬ 
tained  in  [Li].  A  more  general,  but  weaker,  upper  bound  can  be  found  in  [Kl] .  Notice 
that  any  sequence  S  in  L  satisfies  the  requirements  in  Claim  5. 

Claim  6.  At  time  if  &  ,  the  6,-  ’s  between  #  and  the  second  &  are  mapped  into 
Queue  in  the  following  way:  either 

(1)  a  set,  5X,  of  k /c  logAr  bj ’s,  which  belong  to  {6y*  ,  •  •  •  ,6^  },  are  mapped  into  .9,; 


(2)  a  set,  S 2,  of  k /c  logAr  6,-  ’s,  which  belong  to  {6,*  ,  •  ■  •  ,6,  *  },  are  mapped  into  S2. 


Where  c  <<C  is  a  small  constant  as  used  in  Claim  5. 


Proof.  By  Claim  3  we  can  assume  that  from  time  t#  to  time  t  h2  made  less 
k 

than  — - -  round  trips  on  Queue .  Therefore  by  the  nature  of  the  queue,  only 

C  log  k 
2  k  ■ 

— £ -  bf’s  can  be  mapped  into  both  S,  and  S2.  Also  since  h2  can  alternate  between 

C‘log  k 

2k 

Sx  and  S2  less  than  — - - times,  we  complete  the  proof  by  applying  Claim  5. 

C* log  k 


Without  loss  of  generality,  assume  that  (1)  of  Claim  6  is  true. 
Claim  7.  Let  tend  be  the  time  M  ends.  Either 


(a)  There  exists  a  time  If  s<t0<tend  such  that  |  Queue  (f0)  |  < — ^ - 

C*°logn 


and  the 


crossing  sequence  from  tf  g  to  /q  *s  shorter  than 


"  C10logn  loglogn 

(b)  >From  time  tf  &  to  time  tend  the  length  of  the  crossing  sequence  is  shorter  than 


C'10logn  loglogn 


Proof .  If  (a)  and  (b)  are  both  false,  then  M  spends  f)( - ^ - - )  time,  a  con¬ 


tradiction. 


log  n  loglogn 


Bv  Claim  3  the  crossing  sequence  is  shorter  than  — - - -  before  time 

C10log2n  loglogn 

t'  fj  Record  this  crossing  sequence.  For  every  j  ,k ,  if  6*G?i,  then  6;*  is  majorly  mapped 
into  S2.  Let  u; ,  Vj  be  the  substrings  of  6;*  such  that 


(■)  I  «>  |  +  I  vj  |  >  '  ^---1  and 


(ii)  Uj  .Vj  are  mapped  into  S2. 


Let  Suv={uj,vj  |  6/6?! 


for 


some 


k  >1}. 


Notice  that 


Now  we  describe  a  program  which  reconstructs  X  with  less  than  |  X  |  informa¬ 
tion.  Consider  every  Y  such  that  |  Y  |  =  |  X  |  and  Y  =  a  y0  ■  ■  ■  yk  for  some 

yo  •  •  Vk- 

(1)  Check  if  Y  is  the  same  as  X  at  positions  other  than  those  places  occupied  by 
«/  . 

(2)  If  (1)  is  true,  then  construct  the  input,  ivy ,  as  before  except  with  *,•  replaced  by  y, 
for  i  =0,1,  ■  ■  ■  ,k . 

(3)  Run  M  following  path  P  up  to  time  t9 . 

(4)  We  distinguish  between  two  cases  according  to  Claim  7. 

Case  1.  (b)  of  Claim  7  is  true.  Record  the  crossing  sequence  from  tf  &  to  ttnd . 
Continue  to  run  M  from  tf  9  to  teni  such  that  h2  never  goes  into  S2 ■  Whenever 
h  2  reaches  the  border  of  S2  it  matches  the  current  ID  with  the  crossing  sequence. 
If  they  match  M  jumps  over  S2  and,  using  the  next  ID  on  the  other  side  of  S2  to 
start  from,  $M$  continues  until  time  ttni . 

Case  2.  (a)  of  Claim  7  is  true.  Record  the  crossing  sequence  from  time  t  s  to 
time  t0  and  the  contents  of  S2  at  time  f0.  Simulate  M,  with  h2  staying  outside  of 
S2,  from  time  (  g  to  time  t0  similar  to  Case  1.  At  time  t0,  M  puts  the  (short) 
contents  of  S2  in  the  position  of  S2  and  then  finishes  the  computation  in  the  nor¬ 
mal  way. 

(5)  By  the  end  M  accepts  iff  Y  —X .  Notice  that  since  M  is  nondeterministic,  by 
"accept”  we  mean  that  there  is  an  accepting  path. 


Now  the  information  we  used  in  this  program  is  only  the  following. 

(1)  X  -  Suv  ,  plus  the  information  to  describe  the  relative  locations  of  bj  in  X  and 
the  relative  locations  of  uj  ,vnn;-  in  j.  Using  the  coding  method  described  in 
the  previous  part  of  the  paper,  this  would  require  at  most 


m  - 


2c  logn 


C  , 

*3* c  lo«» 


I  n  X 
-<>(•; - )- 

logn 


where  the  second  term  is  for  the  uj, vt  in  Suv ,  the  third  term  is  for  the  information  to 
describe  the  relative  positions  of  bj$5 1(  and  the  last  term  is  for  the  information  needed 
to  describe  the  relative  positions  of  ut  ,vt  in  each  bt 


(2)  Description  of  the  crossing  sequence,  of  length  less  than  — j - ,  around 

C9logn  loglogn 

S2.  Again  by  the  method  used  in  previous  part  of  this  paper,  this  requires  at  most 

_ n _ 

C8logn  loglogn 


bits. 


(3)  Description  of  the  contents  of  Sx  at  time  t0  when  (a)  of  Claim  6  is  true.  But 


(4) 


Quell?  ( t0 )  |  <■ 


C  10logn 

Extra  0  (log  n )  bits  to  describe  the  program  discussed  above. 

The  total  is  less  than  j  X  |  .  Therefore  K (X)<  j  X  \  ,  a  contradiction. 

Corollary.  Simulating  two  deterministic  tapes  by  one  nondeterministic  queue 
requires  Q(n2/log2n  loglogn  ). 

Proof .  Since  L  can  also  be  accepted  by  a  two  tape  Turing  machine  in  linear  time. 


Theorem  5.6:  It  requires  fl(n2)  time  to  simulate  two  deterministic  queues  by  one 
deterministic  queue. 

Proof  Idea.  Define  a  language  L  x  as  follows.  (Below,  a  ,yt  G{0,1}*  -) 

Lx  -  {a  &  xx$x2$  ■  •  ■  ■  •  •  $y, #  (l‘ \1 ' *)(1  1 ; *). ..( 1  I *' )  &  a  \ 

&  (p  =  q  =  j i+  -+ jt)  &  !<*<«} 

L  j  can  be  accepted  by  a  two  queue  deterministic  machine  in  linear  time.  But  using 
the  techniques  in  Theorem  1  and  in  the  proof  of  one  deterministic  Turing  machine  tape 
requiring  square  time  for  this  language  (See  [LV]),  it  can  be  shown  that  Lx  requires 
fi (n2)  for  a  one  queue  deterministic  machine.  We  omit  the  proof. 

Remark.  The  above  lower  bounds  are  similar  to  the  case  of  one  tape  vs  two  tapes 
[Ma,LV].  However,  the  proofs  require  special  techniques  to  handle  the  queues.  Still  we 
do  not  have  a  lower  bound  as  good  as  in  the  nondeterministic  tape  case  [LV.GKS].  We 
feel  that  some  improvement  should  be  possible. 
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